An Evaluation Protocol for the Subjective Assessment of Text-to-Speech in Audiobook Reading Tasks
نویسندگان
چکیده
This paper presents an evaluation protocol for the subjective assessement of text-to-speech in audiobook reading tasks. We developed a questionaire with 11 scales an tested it on TTS data from 4 different synthetic voices, plus one optimized version. A MANOVA on the data gathered with the questionnaire showed that the text type has a significant influence on 7 of the 11 scales. Moreover, the level of familiarity does not have any influence on the ratings. A subsequent Principal Axis Factor (PAF) analysis with Promax rotation resulted in 2 underlying dimensions. The first factor represents the listening pleasure the tested systems achieved. The second dimension comprises scales that evaluate the prosody of the synthesized speech signal. After the analysis of the results we propose to perform slight modifications to the developed questionaire.
منابع مشابه
Reading Performance of Iranian EFL Learners in Paper and Digital texts
Dependence on computers and internet has given birth to digital literacy. However, research into its influences on the reading process is still in its infancy. To fill the gap, this study was designed to investigate the ways in which text presentation mode (paper vs. digital) affects reading comprehension, as well as reading attitudes. To this end, a sample of 30 male and female English major s...
متن کاملجُستاری در رویکرد دیالکتیکی به «خواندن»
Purpose: This article tries to explain that reading is a dialectical action. For this purpose, it refers to the concept of dialectics in ancient times and, with a glance at the concepts of man, world, science, language and knowledge, it tries to discuss the dialectical status of reading. Method: In the present article, a conceptual analysis approach has been used. This approach that is used i...
متن کاملTowards Perceptual Quality Modeling of Synthesized Audiobooks – Blizzard Challenge
This paper reports on recent advances in the field of instrumental quality evaluation of text-to-speech (TTS) synthesis. In particular, a wide range of acoustic quality markers are analyzed concerning their quality-describing power using the audiobook data from the Blizzard Challenge 2012. Several approaches for perceptual modeling are investigated and compared with each other. The results reve...
متن کاملThe USTC System for Blizzard Challenge 2013
This paper introduces the speech synthesis system developed by USTC for Blizzard Challenge 2013. There are two evaluation tasks in this year: the English audiobook tasks and the pilot tasks on 4 Indian languages. According to the various amount of training data, different speech synthesis systems are constructed. The hidden Markov model (HMM) based unit selection and waveform concatenation syst...
متن کاملThe Effect of Variations in Integrated Writing Tasks and Proficiency Level on Features of Written Discourse Generated by Iranian EFL Learners
In recent years, a number of large-scale writing assessments (e.g., TOEFL iBT) have employed integrated writing tests to measure test takers’ academic writing ability. Using a quantitative method, the current study examined how written textual features and use of source material(s) varied across two types of text-based integrated writing tasks (i.e., listening-to-write vs. reading-to-write) and...
متن کامل